Learning Spike-Based Population Codes by Reward and Population Feedback
نویسندگان
چکیده
We investigate a recently proposed model for decision learning in a population of spiking neurons where synaptic plasticity is modulated by a population signal in addition to reward feedback. For the basic model, binary population decision making based on spike/no-spike coding, a detailed computational analysis is given about how learning performance depends on population size and task complexity. Next, we extend the basic model to n-ary decision making and show that it can also be used in conjunction with other population codes such as rate or even latency coding.
منابع مشابه
Ventral pallidal representation of pavlovian cues and reward: population and rate codes.
We recorded neural activity in the ventral pallidum (VP) while rats learned a pavlovian reward association. Rats learned to distinguish a tone that predicted sucrose pellets (CS+) from a different tone that predicted nothing (CS-). Many VP units became responsive to CS+, but few units responded to CS-. When two CS+ were encountered sequentially, the earliest predictor of reward became most pote...
متن کاملCode-Specific Learning Rules Improve Action Selection by Populations of Spiking Neurons
Population coding is widely regarded as a key mechanism for achieving reliable behavioral decisions. We previously introduced reinforcement learning for population-based decision making by spiking neurons. Here we generalize population reinforcement learning to spike-based plasticity rules that take account of the postsynaptic neural code. We consider spike/no-spike, spike count and spike laten...
متن کاملReinforcement learning based feedback control of tumor growth by limiting maximum chemo-drug dose using fuzzy logic
In this paper, a model-free reinforcement learning-based controller is designed to extract a treatment protocol because the design of a model-based controller is complex due to the highly nonlinear dynamics of cancer. The Q-learning algorithm is used to develop an optimal controller for cancer chemotherapy drug dosing. In the Q-learning algorithm, each entry of the Q-table is updated using data...
متن کاملSpike-Based Reinforcement Learning in Continuous State and Action Space: When Policy Gradient Methods Fail
Changes of synaptic connections between neurons are thought to be the physiological basis of learning. These changes can be gated by neuromodulators that encode the presence of reward. We study a family of reward-modulated synaptic learning rules for spiking neurons on a learning task in continuous space inspired by the Morris Water maze. The synaptic update rule modifies the release probabilit...
متن کاملReinforcement Learning in Continuous State and Action Space
To solve complex navigation tasks, autonomous agents such as rats or mobile robots often employ spatial representations. These “maps” can be used for localisation and navigation. We propose a model for spatial learning and navigation based on reinforcement learning. The state space is represented by a population of hippocampal place cells whereas a large number of locomotor neurons in nucleus a...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Neural computation
دوره 22 7 شماره
صفحات -
تاریخ انتشار 2010